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Abstract 

The paper tackles the power of randomization in the context of locality by analyzing the ability to 
"boost" the success probability of deciding a distributed language. The main outcome of this analysis is 
that the distributed computing setting contrasts significantly with the sequential one as far as randomiza- 
tion is concerned. Indeed, we prove that in some cases, the ability to increase the success probability for 
deciding distributed languages is rather limited. 

Informally, a (p, q)-decider for a language £ is a distributed randomized algorithm which accepts 
instances in L with probability at least p and rejects instances outside of L with probability at least q. 
It is known that every hereditary language that can be decided in t rounds by a (p, q) -decider, where 
p 2 + q > 1, can actually be decided deterministically in 0(t) rounds. In one of our results we give 
evidence supporting the conjecture that the above statement holds for all distributed languages and not 
only for hereditary ones. This is achieved by considering the restricted case of path topologies. 

We then turn our attention to the range below the aforementioned threshold, namely, the case where 
p 2 + q < 1. For k E N* U {oo}, we define the class Bf.[t) to be the set of all languages decidable 
in at most t rounds by a (p, g)-decider, where p 1+ ^ + q > 1. It is easy to see that every language is 
decidable (in zero rounds) by a (p, g)-decider satisfying p + q = 1. Hence, the hierarchy B k provides a 
spectrum of complexity classes between determinism (k = 1, under the above conjecture) and complete 
randomization (k = oo). We prove that all these classes are separated, in a strong sense: for every integer 
k > 1, there exists a language L satisfying L G B k+ i(0) but L £ B k (t) for any t — o(n). In addition, 
we show that £?oo(i) does not contain all languages, for any t — o(n). In other words, we obtain the 
following hierarchy: 

Bi{t) C B 2 {t) C ■ ■ ■ C floo(t) C All . 

Finally, we show that if the inputs can be restricted in certain ways, then the ability to boost the success 
probability becomes almost null, and, in particular, derandomization is not possible even beyond the 
threshold p 2 + q = 1. 

All our results hold with respect to the LOCAL model of computation as well as with respect to the 
CONG£ST{B) model, for B = 0(1). 

1 Introduction 

1.1 Background and Motivation 

The impact of randomization on computation is one of the most central questions in computer science. 
In particular, in the context of distributed computing, the question of whether randomization helps in im- 
proving locality for construction problems has been studied extensively. While most of these studies were 
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problem-specific, several attempts have been made for tackling this question from a more general and unified 
perspective. For example, Naor and Stockmeyer [26] focus on a class of problems called LCL (essentially 
a subclass of the class LD discussed below), and show that if there exists a randomized algorithm that con- 
structs a solution for a problem in LCL in a constant number of rounds, then there is also a constant time 
deterministic algorithm constructing a solution for that problem. 

Recently, this question has been studied in the context of local decision, where one aims at deciding 
locally whether a given global input instance belongs to some specified language ||T3l . The localities of 
deterministic algorithms and randomized Monte Carlo algorithms are compared in |[T3l . in the LOCAL 
model (cf. [28)). One of the main results of |[T3l is that randomization does not help for locally deciding 
hereditary languages if the success probability is beyond a certain guarantee threshold. More specifically, 
a (p, q)-decider for a language £ is a distributed randomized Monte Carlo algorithm that accepts instances 
in L with probability at least p and rejects instances outside of L with probability at least q. It was shown 
in lfT3l that every hereditary language that can be decided in t rounds by a (p, g)-decider, where p 2 + 
q > 1, can actually be decided deterministically in 0(t) rounds. On the other hand, lfT3l showed that the 
aforementioned threshold is sharp, at least when hereditary languages are concerned. In particular, for every 
p and q, where p 2 + q < 1, there exists an heredirtary language that cannot be decided deterministically in 
o{n) rounds, but can be decided in zero number of rounds by a (p, q) -decider. 

In one of our results we provide evidence supporting the conjecture that the above statement holds for 
all distributed languages and not only for hereditary ones. This is achieved by considering the restricted 
case of path topologies. In addition, we present a more refined analysis for the family of languages that 
can be decided randomly but not deterministically. That is, we focus on the family of languages that can 
be decided locally by a (p, q) -decider, where p 2 + q < 1, and introduce an infinite hierarchy of classes 
within this family, characterized by the specific relationships between the parameters p and q. As we shall 
see, our results imply that the distributed computing setting contrasts significantly with the sequential one 
as far as randomization is concerned. Indeed, we prove that in some cases, the ability to increase the success 
probability for deciding distributed languages is very limited. 



1.2 Model 

We consider the LOCAL model (cf. ll28l ). which is a standard distributed computing model capturing the 
essence of spatial locality. In this model, processors are woken up simultaneously, and computation proceeds 
in fault-free synchronous rounds during which every processor exchanges messages of unlimited size with 
its neighbors, and performs arbitrary computations on its data. It is important to stress that all the algorithmic 
constructions that we employ in our positive results use messages of constant size (some of which do not 
use any communication at all). Hence, all our results apply not only to the LOCAL model of computation 
but also to more restricted models, for example, the COAfQSST(B) modeQ where B = 0(1). 

A distributed algorithm A that runs on a graph G operates separately on each connected component of 
G, and nodes of a component C of G cannot distinguish the underlying graph G from C. For this reason, 
we consider connected graphs only. 

We focus on distributed decision tasks. Such a task is characterized by a finite or infinite set S of 
symbols (e.g., £ = {0, 1}, or £ = {0, 1}*), and by a distributed language L defined on this set of symbols 
(see below). An instance of a distributed decision task is a pair (G, x) where G is an n-node connected 
graph, and x G S n , that is, every node v G V(G) is assigned as its local input a value x(v) G S. (In some 
cases, the local input of every node is empty, i.e., £ = {e}, where e denotes the empty binary string.) We 
define a distributed language as a decidable collection L of instance^] 

'Essentially, the CONQ£ST{B) model is similar to the HOC AC model, except that the message size is assumed to be bounded 
by at most B bits (for more details, see [28|). 

2 Note that an undecidable collection of instances remains undecidable in the distributed setting too. 
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In the context of distributed computing, each processor must produce a boolean output, and the decision 
is defined by the conjunction of the processors outputs, i.e., if the instance belongs to the language, then 
all processors must output "yes", and otherwise, at least one processor must output "no". Formally, for a 
distributed language £, we say that a distributed algorithm A decides C if and only if for every instance 
(G, x) and id-assignment Id, every node v of G eventually terminates and produces an output denoted 
out_4(G, x, Id, v), which is either "yes" or "no", satisfying the following decision rules: 

• If (G, x) E C then out_4(G, x, Id, v) = "yes" for every node v E V(G) ; 

• If (G, x) ^ C then out_4(G, x, Id, v) = "no" for at least one node v E V(G) . 

Observe that decision problems provide a natural framework for tackling fault-tolerance: the processors 
have to collectively check whether the network is fault-free, and a node detecting a fault raises an alarm. In 
fact, many natural problems can be phrased as decision problems, for example: "is the network planar?" or 
"is there a unique leader in the network?". Moreover, decision problems occur naturally when one aims at 
checking the validity of the output of a computational task, such as "is the produced coloring legal?", or "is 
the constructed subgraph an MST?". 

The class of decision problems that can be solved in at most t communication rounds is denoted by 
LD(i), for local decision. More precisely, let t be a function of triplets (G,x,Id), where Id denotes the 
identity assignment to the nodes of G. Then LD(t) is the class of all distributed languages that can be de- 
cided by a distributed algorithm that runs in at most t communication rounds. The randomized (Monte Carlo 
2-sided error) version of the class LD(i) is denoted BPLD(t,p, q), which stands for bounded-error proba- 
bilistic local decision, and provides an analog of BPP for distributed computing, where p and q respectively 
denote the yes-error and the no-error guarantees. More precisely, a randomized distributed algorithm is a 
distributed algorithm A that enables every node v, at any round r during its execution, to generate a certain 
number of random bits. For constants p,q E (0, 1], we say that a randomized distributed algorithm A is 
a (p, q)-decider for C, or, that it decides C with "yes" success probability p and "no" success probability 
q, if and only if for every instance (G, x) and id-assignment Id, every node of G eventually terminates and 
outputs "yes" or "no", and the following properties are satisfied: 

• If (G,x) E C then Pr[Vu E V(G), 

ouU(G, x, Id, v) = "yes"] > p ; 

• If (G, x) £ £ then Pi[3v E V(G), 

out A (G,x,ld,v) = "no"] > q . 

The probabilities in the above definition are taken over all possible coin tosses performed by the nodes. 
The running time of a (p, g)-decider executed on a node v depends on the triple (G, x, Id) and on the results 
of the coin tosses. In the context of a randomized algorithm, T V (G, x, Id) denotes the maximal running time 
of the algorithm on v over all possible coin tosses, for the instance (G, x) and id-assignment Id. Now, just 
as in the deterministic case, the running time T of the (p, g)-decider is the maximum running time over all 
nodes. Note that by definition of the distributed Monte-Carlo algorithm, both T v and T are deterministic. 
For constant p,q G (0, 1] and a function t of triplets (G, x, Id), BPLD(t, p, q) is the class of all distributed 
languages that have a randomized distributed (p, q)-decider running in time at most t (i.e., can be decided 
in time at most t by a randomized distributed algorithm with "yes" success probability p and "no"success 
probability q). 

Our main interest within this context is in studying the connections between the classes BPLD(t, p, q). In 
particular, we are interested in the question of whether one can "boost" the success probabilities of a (p, q)- 
decider. (Recall that in the sequential Monte Carlo setting, such "boosting" can easily be achieved by repeat- 
ing the execution of the algorithm a large number of times.) Our starting point is the recent result of [ 13] that, 
for the class of hereditary languages (i.e., closed under sub-graphs), the relation p 2 + q = 1 is a sharp thresh- 
old for randomization. That is, for hereditary languages, [J p 2 +q> i BPLD(t,p, q) collapses to LD(0(i)), but 
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for any p,q £ (0, 1] such that p 2 + q < 1 there exists a language L G BPLD(0,p, q), while L ^ LD(t) 
for any t = o(n). We conjecture that the hereditary assumption can be removed and we give some evi- 
dence supporting this conjecture. Aiming at analyzing the collection of classes U p 2 +g<1 BPLD(i,p, q), we 
consider the following set of classes: 

B k (t)= |J BPLD(i,p,g) 

pl + l/fc+q>l 

for any positive integer k, as well as the class 

#00(0= (J BFLD(t,p,g). 

P+g>l 

Hence, our conjecture states that Bi(t) = LD(0(t)). Note that the class U p +q>i BPLD(0,p, q) contains all 
languages, using a (1, 0)-decider that systematically returns "yes" at every node (without any communica- 
tion). Hence, the classes provide a smooth spectrum of randomized distributed complexity classes, from 
the class of deterministically decidable languages (under our conjecture) to the class of all languages. The 
ability of boosting the success probabilities of a (p, g)-decider is directly related to the question of whether 
these classes are different, and to what extent. 

1.3 Our results 

One of the main outcomes of this paper is a proof that boosting success probabilities in the distributed 
setting appears to be quite limited. By definition, Bk(t) C Bk+i(t) for any k and t. We prove that these 
inclusions are strict. In fact, our separation result is much stronger. We prove that there exists a language in 
.Bfc_l_i(0) that is not in B^{t) for any t = o(n). Moreover, we prove that Tree ^ Boo(t) for any t = o(n), 
where Tree = {(G, e) : G is a tree}. Hence, B^t) does not contain all languages, even for t = o(n). In 
summary, we obtain the following hierarchy. 

fli(t)Cfl2(t)C---Cfloo(t)cAll. 

These results demonstrate that boosting the probability of success might be doable, but only from a (p, q) 
pair satisfying p l+l K k+1 ) + q > 1 to a (p, q) pair satisfying p 1+l / k + q > 1 (with the extremes excluded). 
It is an open question whether Bk+i(t) actually collapses to BPLD (0(t),p, q), where p l+l / k + q = 1, or 
whether there exist intermediate classes. 

Recall that every hereditary language in B\(t) is also in LD(0(t)) [13]. We conjecture that this deran- 
domization result holds for all languages and not only for hereditary ones. We give evidence supporting 
this conjecture by showing that restricted to path topologies, finite input and constant running time t, the 
statement B\{t) C LD(0(t)) holds without assuming the hereditary property. This evidence seems to be 
quite meaningful especially since all our separation results hold even if we restrict ourselves to decision 
problems on path topologies. 

Finally, we show that the situation changes drastically if the distribution of inputs can be restricted in 
certain ways. Indeed, we show that for every two reals < r < r', there exists a language in C r > (0) that is 
not in C r (t) for any t = o(n), where the C-classes are the extension of the S-classes to decision problems 
in which the inputs can be restricted. 

All our results hold not only with respect to the LOCAL model but also with respect to more restrictive 
models of computation such as the COMQSST(B) model (for B = 0(1)). 
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1.4 Related work 



The notion of local decision and local verification of languages has received quite a lot of attention recently. 
In the LOCAL model, for example, solving a decision problem requires the processors to independently in- 
spect their local neighborhood and collectively decide whether the global instance belongs to some specified 
language. Inspired by classical computation complexity theory, Fraigniaud et al j[T3l suggested that the study 
of decision problems may lead to new structural insights also in the more complex distributed computing 
setting. Indeed, following that paper, efforts were made to form a fundamental computational complexity 
theory for distributed decision problems in various other aspects of distributed computing |[T3l[T4l[T5l[T6l . 

The classes LD, NLD and BPLD defined in [13] are the distributed analogues of the classes P, NP and 
BPP, respectively. The contribution of lfT3l is threefold: it establishes the impact of nondeterminism, ran- 
domization, and randomization + nondeterminism, on local computation. This is done by proving structural 
results, developing a notion of local reduction and establishing completeness results. One of the main results 
is the existence of a sharp threshold for randomization, above which randomization does not help (at least 
for hereditary languages). More precisely the BPLD classes were classified into two: below and above the 
randomization threshold. The current paper "zooms" into the spectrum of classes below the randomization 
threshold, and defines a hierarchy of an infinite set of BPLD classes, each of which is separated from the 
class above it in the hierarchy. 

The question of whether randomization helps in improving locality for construction problems has been 
studied extensively. Naor and Stockmeyer ll26ll considered a subclass of LD(0(1)), called LClj^J and studied 
the question of how to compute in 0(1) rounds the constructive versions of decision problems in LCL. The 
paper demonstrates that randomization does not help, in the sense that if a problem has a local Monte 
Carlo randomized algorithm, then it also has a local deterministic algorithm. There are several differences 
between the setting of [26] and ours. First, [26] considers the power of randomization for constructing a 
solution, whereas we study the power of randomization for deciding language^] Second, while [26 ] deals 
with constant time computations, our separation results apply to arbitrary time computations, potentially 
depending on the size of the instance (graph and input). To summarize, the different settings imply different 
impacts for randomization: while this current paper as well as [ 13] show that randomization can indeed help 
for improving locality of decision problems, [26] shows that when it comes to constructing a solution for a 
problem in LCL in constant time, randomization does not help. The question of whether randomization helps 
for constructing solutions to some specific problems in localized computational models such as MIS, (A + 
1) -coloring, and maximal matching has been also studied in (2l[5l|23l|24j|25l|27l|29l. The original theoretical 
basis for nondeterminism in local computation was laid by the theories of proof-labeling schemes ifTTl [T9l 
|20l|2T]|, which resemble the notion of NLD, and local computation with advice (7J[l0l[TTl|T3. These notions 
also bear some similarities to the notions of local detection 0], local checking |@), or silent stabilization 0, 
which were introduced in the context of self-stabilization [ 8 ] . In addition, NLD seems to be related also to 
the theory of lifts 0. 

Finally, the classification of decision problems in distributed computing has been studied in several other 
models. For example, |6l and ifTSl study specific decision problems in the CONGEST model. In addition, 
decision problems have been studied in the asynchrony discipline too, specifically in the framework of wait- 
free computation 1 151 PT6l and mobile agents computing lfl4l . In the wait-free model, the main issues are not 
spatial constraints but timing constraints (asynchronism and faults). The main focus of |[T6l is deterministic 
protocols aiming at studying the power of the "decoder", i.e., the interpretation of the results. While this 

3 LCL is essentially LD(0(1)) restricted to languages involving graphs of constant maximum degree and processor inputs taken 
from a set of constant size. 

4 There is a fundamental difference between such tasks when locality is concerned. Indeed, whereas the validity of constructing 
a problem in LCL is local (by definition), the validity in our setting is "global", in the sense that in an illegal instance, it is sufficient 
that at least one vertex in the entire network outputs "no". 
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paper essentially considers the AND-checker, (as a global "yes" corresponds to all processes saying "yes"), 
[ 16 ] deals with other interpretations, including more values (not only "yes" and "no"), with the objective of 
designing checkers that use the smallest number of values. 



2 Preliminaries 

This section recalls some previous results from the literature, to be used throughout in the paper. Let us 
first recall that in the LOCAL (respectively COMQ£ST(B)) model, processors perform in synchronous 
rounds, and, in each round, every processor (1) sends messages of arbitrary (resp., 0(B) bits) size to its 
neighbors, (2) receives messages from its neighbors, and (3) performs arbitrary individual computations. 
After a number of rounds (that may depend on the network G connecting the processors, and may vary 
among the processors, since nodes have different identities, potentially different inputs, and are typically 
located at non-isomorphic positions in the network), every processor v terminates and generates its output. 

Consider a distributed (p, g)-decider A running in a network G with input x and identity assignment 
Id (assigning distinct integers to the nodes of G). The output of processor v in this scenario is denoted by 
out^(G, x, Id, v), or simply out(u) when the parameters are clear from the context. In the case of decision 
problem, out(i>) £ {"yes", "no"} for every processor v. 

An n-node path P is represented as a sequence P = (1, . . . , n), oriented from left to right. (However, 
node i does not know its position in the path.) Given an instance (P, x) with ID's Id and a subpath S C 
P, let xs (respectively Ids) be the restriction of x (resp., Id) to S. We sometimes refer to subpath S = 
(ui, . . . ,Uj) C P as S = [i, j]. For a set U C V(G), let £(G, x, Id, U) denote the event that, when running 
A on (G,x) with id-assignment Id, all nodes in U output "yes". Given a language L, an instance (G,x) is 
called legal iff (G, x) £ L. 

Given a time bound t, a subpath S = [i, j] is called an internal subpath of P if % > t+2 and j < n—t—1. 
Note that if the subpath S is internal to P, then when running a i -round algorithm, none of the nodes in S 
"sees" the endpoints of P. 

We now define an important concept, which is crucial in the proofs of our separation results. 

Definition 2.1 Let S be a subpath of P. For 5 € [0, 1], S is said to be a (5, A)-secure subpath if\S\ > A 
and Pr[£(P,x, Id, V(S))] > l-S. 

We typically use (5, A)-secure subpaths for values of A > 2t + 1 where t is the running time of the 
(p, g)-decider A on (P, x) for some fixed identity assignment Id. Indeed, it is known [ 13 ] that if (P, x) £ L, 
then every long enough subpath S of P contains an internal (5, A)-secure subpath S'. More precisely, define 

£(S, A) = 4(A + 2t) \\ogp/ log(l - 5)] . (1) 

We have the following: 

Fact 2.2 ( 11131 ) Let (P, x) £ L, 5 € [0, 1], A > 1. Then for every £(5, X)-length subpath S there is a subpath 
S' (internal to S) that is (5, X)-secure. 

For completeness, we provide the proof of this fact in the Appendix. To avoid cumbersome notation, when 
A = 2t + 1, we may omit it and refer to (5, 2t + Insecure subpaths as 5-secure subpaths. In addition, set 

1(5) := £(5,2t + l). 



Let us next illustrate a typical use of Fact 2.2 Recall that t denotes the running time of the (p, g)-decider A 



on (P, x) £ L with IDs Id. Let S be a subpath of P of length £(5). Denote by L (resp., R) the subpath of 
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P to the "left" (resp., "right") of S. Informally, if the length of S is larger than 2t + 1, then S serves as a 
separator between the two subpaths L and R. This follows since as algorithm A runs in t rounds, each node 
in P is affected only by its t neighborhood. As the t neighborhood of every node «el and v G R do not 
intersect, the events £(P, x, Id, L) and £(P, x, Id, R) are independent. 

The secureness property becomes useful when bounding the probability that at least some node in 
P says "no". A natural approach to upper bound this probability is by applying a union bound on the 
events £ (P, x,Id, V(L) U V(R)) and £ (P, x,Id, V (£)). Letting £' denote the event complementary to 
£{P,x,ld,V(P)), we have 

Pr[£'} = l-Pr[£(P,x,Id,y(P))] 

< (l-Pv[£(P,x,ld,V(L))] 

■Pv[£(P,x,ld,V(R))}) 
+ (l-Pr[£(P,x,ld,V(S))}) 

< l-Pv[£(P,x,ld,V(L))} 

■Pr[£(P,x,ld,V(R))]+5 . 

The specific choice of A and 5 depends on the context. Informally, the guiding principle is to set 5 small 
enough so that the role of the central section S can be neglected, while dealing separately with the two 
extreme sections L and R become manageable for they are sufficiently far apart. 



3 The Bk hierarchy is strict 

In this section we show that the classes B k , k > 1, form an infinite hierarchy of distinct classes, thereby 
proving that the general ability to boost the probability of success for a randomized decision problem is 
quite limited. In fact, we show separation in a very strong sense: there are decision problems in B k+ ±(0), 
i.e., that have a (p, g)-decider running in zero rounds with + q > 1, which cannot be decided by 

a (p, g)-decider with p 1+1 / k + g > 1, even if the number of rounds of the latter is as large as n 1 ^ 6 for every 
fixed e > 0. 

Theorem 3.1 B k +i(0) \ Bk{t) ^ for every k > 1 and every t = o{n). 

Proof: Let k be any positive integer. We consider the following distributed language, which is a generalized 
variant AMOS-fc of the problem AMOS introduced in O. As in AMOS, the input x of AMOS -A; satisfies 
x G {0, l} n , i.e., each node v is given as input a boolean x(v). The language AMOS-fe is then defined by: 

At-Most-k-Selected (AMOS-fc) = 
{(G,x) s.t. || x (I, < k}. 

Namely, AMOS-fc consists of all instances containing at most k selected nodes (i.e., at most k nodes with 



input 1), with all other nodes unselected (having input 0). In order to prove Theorem 3.1 we show that 
AMOS-A; G B k+1 (0) \ B k (t) for every t = o(n). 

We first establish that AMOS-A; belongs to B k+1 (0). We adapt algorithm A presented in flj) for AMOS 
to the case of AMOS-A;. The following simple randomized algorithm runs in time: every node v which 
is not selected, i.e., such that x(v) = 0, says "yes" with probability 1; and every node which is selected, 
i.e., such that x(v) = 1, says "yes" with probability and "no" with probability 1 — p l / k . If the 
graph has s < k nodes selected, then all nodes say "yes" with probability p s / k > p, as desired. On 
the other hand, if there are s > k + 1 selected nodes, then at least one node says "no" with probability 



7 



1 _ p s / k > 1 — p( k + 1 )/ k = i — p 1 + 1 / fe . We therefore get a (p, g)-decider with p 1+1 / k + q >1, that is, such 
thatp 1+1 /( fc+1 ) + q > 1. Thus AMOS-A; G Bjfe +1 (0). 

We now consider the harder direction, and prove that AMOS-A; ^ Sfc(i), for any £ = o(n). To prove 
this separation, it is sufficient to consider AMOS-A; restricted to the family of n-node paths. Fix a function 
£ = o(n), and assume, towards contradiction, that there exists a distributed (p, g)-decider A for AMOS-A; 
that runs in 0(t) rounds, with p 1+1 / k + q > 1. Let e G (0, 1) be such that p 1 + 1 / fc + e + g > 1. Let P be an 
n-node path, and let S C P be a subpath of P. Let 5 G [0, 1] be a constant satisfying 

0<5<p 1+1/k (l-p £ )/k. (2) 

Consider a positive instance and a negative instance of AMOS-A;, respectively denoted by 

I = (P,x) and I' = (P,x') . 

Both instances are defined on the same n-node path P, where n > k (£(5) + 1) + 1. 

Recall that 1(5) = £(5, 2£ + 1) (see Eq. ([!])). We consider executions of A on these two instances, where 
nodes are given the same id's. Both instances have almost the same input. In particular, the only difference 
is that instance / contains k selected nodes, whereas I' has the same selected nodes as / plus one additional 
selected node. Therefore I is legal, while I' is illegal. In P, the path P is composed of k + 1 sections, 
each containing a unique selected node, and where each pair of consecutive sections separated by a <5-secure 
subpath. More precisely, let us enumerate the nodes of P from 1 to n, with node v adjacent to nodes v — 1 
and v + 1, for every 1 < v < n. Consider the k subpaths of P defined by: 

Si = [(i-l)£(5) + i + l,i-£(5)+i] 

fori = {1, . . . , At}. Let the selected nodes in P be positioned as follows. Let u\ = 1 and let Ui = 
(i - l)£{5) + i for i = 2, . . . , k + 1. Then set 



x» 



1, if v = ui for some i G {1, k + 1} 
0, otherwise. 



See Fig. [^a) for a schematic representation of P. Our next goal is to define the legal instance / = (P, x). 
To do so, we begin by claiming that each Si contains a 5-secure internal subpath S^ = [oj, bj\. Naturally, we 



would like to employ Fact 2.2 However, Fact 2.2 refers to subpaths of valid instances (P, x) G C, and I 1 is 
illegal. So instead, let us focus on the instance (Si,x' Si ). Since (Si,x' s ,) contains no leaders, || x' s _ |^ = 0, 
it follows that (Si, x' s .) G £, and Fact 2.2 can be applied on it. Subsequently, since \Si\ > 1(5) it follows 



that Si contains an internal <5-secure subpath S[ = [oj, bj\, whose £ neighborhood is strictly in Si. Therefore, 
when applying algorithm A on (Si,x' s ., IdgJ and on (P, x',Id), the nodes in the (2£ + l)-length segment 
SI behave the same, thus Pr[<S(P,x / ,Id, = Pr^^x'^Id^, V(5{))]. Hence, is a <5-secure 

subpath in (P, x', Id) as well, for every i G {1, A;}, see Fig. j^b). 

The J-secure subpaths S'-'s are now used to divide P into 2A; + 1 segments. Specifically, there are k + 1 
segments Tj, i = 1, . . . , k + 1, each with one selected node. The 5-secure subpaths S' { = {en, bj\ separate Tj 
from Ti + \. More precisely, set T% = [l,ai — l],Tj = [6j_i + l, a^ — 1] for i G 2, k, and T^+i = [b^ + 1, n], 
getting 

P = Ti o S[ o T 2 o S' 2 o . . . o T fc o S' k o T k+ i 

where o denotes path concatenation. Let % = £ (P, x', Id, V(Ti)) be the event that all nodes in the subpath 
Ti say "yes" in the instance I', for i G {1, k + 1} and let pi = Pr[7i] be its probability. Let j be such 
that pj = maxj pi. We are now ready to define the valid instance / = (P, x): 

1, if v = Ui for some i G {1, k + 1}, 
x(v) = { i^j 

1 otherwise. 
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Note that || x' || x = k + 1 and || X || x = k, thus I G AMOS-A; while /' ^ AMOS-A;. See Fig. ^c,d) for an 
illustration of / versus I'. 

We now make the following observation. 

Claim 3.2 V* + j, Pr[£(P, x, Id, V(Tj)] = Pi . 

Proof: This follows since the distance between any two nodes u (resp., v) in distinct T[s is greater than i, 
which implies that x(Lj oTj o Rj) = x'(Lj oTj o Rj) where Li (resp., Ri) is the subpath of length t to the left 
(resp., to the right) of in P, from which it follows that under A the nodes of Tj have the same behavior in 
both instances / and I'. | 

Let M (resp., M') be the event that there exists at least one node in / (resp., /') that says "no" when apply- 
ing algorithm A. Similarly, let y (resp., y') be the event stating that all nodes in the configuration / (resp., 
/') say "yes". Let T = Ui=i T% t> e tne e vent that all nodes in each subpaths Tj, for i G {1, k + 1} say 
"yes" in the instance For every i G {1, k}, let Si = £{P, x', Id, V(S'i)) be the event that all nodes in 
the 5-secure subpath 5- say "yes" in the instance I'. We have Pr(^) = Pr[£(P, x, Id, V(P))\ and Pr(y ) = 
Pr[£(P, x', Id, V{P)), while Pr(W) = 1 - Pr(^) andPr(AA') = 1 - Pr(y). 

Since A is a (p, q)-decider, as we assume by contradiction that AMOS-A; in B^, we have Pr(A/"') > q, 
and thus Pr(AA') > 1 - pi+V^+e. Therefore, Pr(y) < pi+Vk+e. Moreover, since / G AMOS-A;, we also 
have that Pr(y) > P- Therefore, the ratio p = Pr(y)/ Pr(yj satisfies 

p < p 1/k+£ . (3) 

On the other hand, note that by applying the union bound to the k + 1 events T, Ui=i <%> we § et 



Pr(AA') < (l-Prt^+mi-Pr^Dj 



< 1 - Pj ■ \\Pi + k ■ 5, 

where the last inequality follows by the fact that each is a (5, 2t + l)-secure subpath, thus the events 
77 x , % 2 are independent for every ii, 12 G {1, A; + 1} (since the distance between any two nodes u G 
and v G T i2 is at least 2t + 1). This implies that Pr(y) > pj • Yii^jPi -k-S . Since Pr(X) < Yii+jPi 
(by the independence of the events Ti 1 ,Ti 2 , for every i\,t2 £ {1, ...k + 1}), it then follows that the ratio p 
satisfies 

P > ff^ 

k ■ 5 

> Pj ~ Yi >Pj ~k- 5/p , (4) 

where the last inequality follows by the fact that / G AMOS-A; and thus Yii^j Pi — P r (3^) > P- Finally, note 
that pj > p l l k . This follows since pj > pi for every i G {1, At + 1}, so > Yltyj Pi — P- By Eq. (jjj), 
we then have that p > p x l k — k ■ 5/p. Combining this with Eq. we get that p 1 ^ — k ■ 5/p < p 1 / fe+e , 
which is in contradiction to the definition of 5 in Eq. (|2j). | 

Finally, we show that the Bk (t) hierarchy does not capture all languages even for k = oo and t as large 
as o(n). The proof of the following theorem is deferred to the Appendix. 

Theorem 3.3 There is a language not in B^ (t), for every t = o(n). 
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4 A sharp threshold between determinism and randomization 



It is known |[T3l that beyond the threshold p 2 + q = 1, randomization does not help. This result however 
holds only for a particular type of languages, called hereditary, i.e., closed under inclusion. In this section, 
we provide one more evidence supporting our belief that the threshold p 2 + q = 1 identified in lfX3Tl holds 
for all languages, and not only for hereditary languages. Indeed, we prove that, restricted to path topologies 
and finite inputs, every language C for which there exists a (p, g)-decider running in constant time, with 
p 2 + q > 1, can actually be decided deterministically in constant time. 

Theorem 4.1 Let £ be a distributed language restricted to paths, with a finite set of input values. If C G 
Bi(0(l)), thenC G LD(0(1)). 

Proof: Let C G Pi (0(1)) be a distributed language restricted to paths, and defined on the (finite) input set 
E. Consider a distributed (p, g)-decider A for C that runs in t = 0(1) rounds, with p 2 + q > 1. Fix a 
constant S such that 0<<5<p 2 + g — 1. 

Given a subpath S of a path P, let us denote by Si (respectively, S r ) the subpath of P to the left (resp., 
right) of S, so that P = Si o S o S r . 

Informally, a collection of three paths P, P', and P" (of possibly different lengths) is called a X-path 
triplet if (1) the inputs of those paths agree on some "middle" subpath of size at least A, (2) paths P and P" 
coincide on their corresponding "left" parts, and (3) paths P' and P" coincide on their "right" parts. See Fig- 
ure|4] Formally, a X-path triplet is a triplet [(P, S,x), (P', S',x'), (P", S",x")] such that \P\, \P'\, \P"\ > X, 
x,x',x" are inputs on these paths, respectively, and S C P, S' C P', S" C P" are three subpaths satisfying 
(1) \S\ = \S'\ = \S"\ > X, (2) x s = x' s , = x" s „, and (3) x^„ = x 5; and x" s „ = x' s ,. The proof of the 
following claim is deferred to the Appendix. 

Claim 4.2 Let [(P, S, x), (P', S', x'), (P", S", x")\ be a X-path triplet. If X > 1(5), for I as defined in 
Eq. iil then ((P,x) G C and (P',x') G c) => (P",x") G C. 



We now observe that, without loss of generality, one can assume that in all instances (P, x) of C, the 
two extreme vertices of the path P have a special input symbol <g>. To see why this holds, let <g) be a symbol 
not in S, and consider the following language £' defined over E U {®}- Language C consists of instances 
(P, x) such that (1) the endpoints of P have input ®, and (2) (P',x') G £, where P' is the path resulting 
from removing the endpoints of P, and where x^ = X^ for every node v of P' . Any (p, q) decider algorithm 
for C (resp., £), can be trivially transformed into a (p, q) decider algorithm for £ (resp., C) with the same 
success guarantees and running time. Hence, in the remaining of the proof, we assume that in all instances 
(P, x) G C, the two extreme vertices of the path P have input ®. 

We say that a given instance (P, x) is extendable if there exists an extension of it in C, i.e., if there exists 
an instance (P',x') G C such that P C P 1 and x' p = x. The proof of the following claim is deferred to the 
Appendix. 

Claim 4.3 There exists a (centralized) algorithm X that, given any configuration (P, x) with \P\ < 21(5) + 
1, decides whether (P, x) is extendable. Moreover, algorithm X uses messages of constant size. 



We may assume, hereafter, that such an algorithm X, as promised by Claim 4.3 is part of the language 
specification given to the nodes. We show that C G LD(0(t)) by proving the existence of a deterministic 
algorithm V that recognizes C in 0(t) rounds. Given a path P, an input x over P, and an identity assignment 
Id, algorithm V applied at a node u of P operates as follows. If x u = ® then u outputs "yes" if and only 
if u is an endpoint of P. Otherwise, i.e., if x u ^ {(£>}, then u outputs "yes" if and only if (B u ,xb u ) is 
extendable (using algorithm X), where B u = B(u, £(5)) is the ball centered at u, and of radius £(5) in P. 
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Algorithm V is a deterministic algorithm that runs in 1(5) rounds. We claim that Algorithm V recognizes 
C. To establish that claim, consider first an instance (P, x) G C. For every node u, (P, x) G C is an 
extension of (B u ,xb u )- Therefore, every node u outputs "yes", as desired. Now consider an instance 
(P, x) $l C. Assume, for the purpose of contradiction, that there exists an identity assignment Id such that, 
when applying V on (P, x, Id), every node u outputs "yes". 

Claim 4.4 In this case, \P\ > 21(5) + 1. 

Proof: Assume by contradiction that \P\ < 21(5) + 1, and consider the middle node s of P. Since s 
outputs "yes", it follows that (P,x) can be extended to (P',x') such that (P',x') G £. However, since the 
extremities of P output "yes", it means that their input is ®. Therefore, as \P'\ > \P\, we get that there is 
an internal node of P' which has input ®, contradicting (P', x') G C. | 

Let S C P be the longest subpath of P such that there exists an extension (P',x') of (S,x$), with 
(P f , x') G C. Since \P\ > 21(5) + 1, and since the middle node of P outputs "yes", we have |5| > 2£(5) + 1. 
The proof carries on by distinguishing two cases for the length of S. 

If S = P, then (P,x) can be extended to (P',x') G C. By the same arguments as above, since each 
extremity w of P has input (g), we conclude that P = P' , with x = x'. Contradicting the fact that (P, x) ^ C. 
Therefore 21(5) + 1 < 1 5| < \P\. Let a and b be such that S = [a, b] . As S is shorter than P, it is impossible 
for both a and b to be endpoints of P. Without loss of generality, assume that a is not an endpoint of P. 
Since a outputs "yes", there exists an extension (P",x") G C of (B a ,xs a )- In fact, (P",x") is also an 
extension of Xj a a+ £/g\y Since x' and x" agree on [a, a + £(6)], and since both (P' , x 1 ), and (P", x") are in 



C, we get from Lemma 4.2 that Xr a _ lj6 i can be extended to an input (P"', x'") G C, which contradicts the 



choice of S. The theorem follows. | 



5 On the impossibility of boosting 



Theorems |3.1 1 and 3.3 demonstrate that boosting the probability of success might be doable, but only from 
(p, q) satisfying pW/t^+i) + g > 1 to (p, q) satisfying p 1+1 / k + q > 1 (with the extremes excluded). In 
this section, we prove that once the inputs may be restricted in certain ways, the ability to boost the success 
probability become almost null. More precisely, recall that so far we considered languages as collections 
of pairs (G, x) where G is a (connected) n-node graph and x G E n is the input vector to the nodes of G, 
in some finite of infinite alphabet S, that is, x(v) G £ for all v G V(G). An instance of an algorithm 
A deciding a language C was defined as any such pair (G, x). We now consider the case where the set of 
instances is restricted to some specific subset of inputs Ic S" That is, the distributed algorithm A has now 
the promise that in the instances (G, x) admissible as inputs, the input vector x is restricted to x G T C S n . 

We define the classes C r (t) in a way identical to the classes Bk(t), but generalized in two ways. First, 
the parameter r is not bounded to be integral, but can be any positive real. Second, the decision problems 
under consideration are extended to the ones in which the set of input vectors x can be restricted. So, in 
particular, Bk(t) C Ck(t), for every positive integer k, and every function t. The following theorem proves 
that boosting can made as limited as desired. The proof of this theorem is deferred to the Appendix. 

Theorem 5.1 Let r < r' be any two positive reals. Then, C r '(0) \ C r (t) ^ for every t = o(n). 



Note that Theorem 5.1 demonstrates not only the (almost) inability of boosting the probability of success 
when the inputs to the nodes are restricted to specific kinds, but also the inability of derandomizing, even 
above the threshold p 2 + q = 1. Indeed, the following is a direct consequence of Theorem 



5.1 



Corollary 5.2 For every positive real r, there is a decision problem in C r (0) which cannot be decided 
deterministically in o(n) rounds. 
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APPENDIX 



A Some proofs 



Proof of Fact 2.2 : Fix any identity assignment Id and let S C P be a subpath of length d > £(5, A), where 
S = (y\, . . . , Vd)- In what follows, we may override the nodes with their indices in S, namely i. Let 
r( = [~(A — 1) /2] . Given a vertex V{ G S, for « € (re, d — re), let Bi be the subpath of its re neighborhood 
in S*. Formally, Bi = [i — re, i + re]. In addition, let B{ be the event that all nodes in Bi say "yes" when 
applying A on P. That is B { = £(P, x, Id, F Let 

i?= (r £ + t+l,d-r £ -t) 

be the ranges of indices i whose re neighborhood Bi is internal in S, i.e., Bj C [t + 1, d — t — 1]. That is 
by definition, for every i € R, it holds that i?i is at least of length A and is internal in S. See Fig. [T] for 
illustration. We would like to show that there exists i* G R, such that Bi* is a (<5, A)-secure subpath, i.e., 
that Pr[Bi] > 1 — 5. 

To establish the existence of i* G i?, we bound from above the size of the set % containing all indices 
j G R whose Bj is not a (6, A)-secure subpath. Formally, define 

U = {j G R I Pv[Bj] <l-5}. 

We next upper bound the size of %, by covering the integers in R by at most 

Q = 2(t + r/) + 1 

sets, each being a Q-independent set. That is, every two integers in the same set, are at least Q apart in P. 
Let J\,..., Jq be the Q-independent sets that cover the indices of R. 

Specifically, for s G [1, Q] and m(S) = \(d — 4(r^ + t))/Q], we define 

J s = {s + re + t + l+j-Q\je [0, m(5)]} n [1, n]. 

Observe, that as desired, R C U s e[i Q] ^s, and for each s G [1, Q], J s is a Q-independent set. In what 
follows, fix s G [1, Q] and let = J s . Let J 1 = J C\1-L, the indices in ^7 whose neighborhood does not 
correspond to (5, A)-secure subpath. On the one hand, since to a (P, x) G C, we have that 



Pr 



> p. 



(A.l) 



On the other hand, observe that for every two indices i\, 12 G J' , it holds that the distance between every 
u G Bi x and v G Bi 2 is at least 2t + I (since J' is Q-independent). Hence, the events Bi L and Bj 2 are 
independent. We therefore get that 



Pr 



Us. 

iej' 



Y[PT[Bi]<(l-8)W, 

ieJ' 



(A.2) 



where the last inequality follows by the fact that J' C %. Combining Eq. and \K.2\ we get that 

\J'\ <logp/log(l-5). 



i 



We are now ready to upper bound the size of %. Noting that S can be covered by an union of Q sets J s each 
of which is Q-independent, we get that 



Q 



\n\ 



£ltfl<Q 



i=i 



logp 
log(l-5) : 



(A3) 



where J{ = J { n H. Finally note that \R\ > \H\. This follows by the fact that \R\ = \S\ - 2(r e + t + 1) 
and \S\ > 1(8, A). Hence, 



= \S\-2(r e + t + l) 

,X)-2(r e + t + l) 
logp 

Mi - 5) 

HI, 



> *( 

> Q 
> 



where the last inequality follows by Eq. ( A.3 1. By the pigeonhole principle we get that there exists i* G R 
which is not in %. Thus Bi* is a (5, A)-secure subpath, as required. The fact follows. 



Proof of Theorem 3.3 : We exhibit one specific language not in B^t), for every t = o(n). This language 



consists of determining whether the underlying network is acyclic. Specifically, let 

Tree = {(G, e) | G is a tree}, 

where e is the null input. Fix a function t = o(n). Assume, towards contradiction, that there exists some 
finite k such that Tree G B^(t). Then there is a (p, g)-decider for Tree, given by ^4, running in t rounds, 
with p 1+1 / k + q — 1 > 0. Hence, in particular, there exists some e > such that p + q — 1 > e. Define 



We consider graphs G of size n > 



21-logp 
log(l-S) 



f < 



£ = £<p + g— 1. 
. We will show that Tree ^ By.(t) for any 
log(l - 8) 



(A.4) 



21 • logp 



n 



o(n). 



(A.5) 



Consider the cycle C with n nodes labeled consecutively from 1 to n, and the path P with nodes labeled 
consecutively from 1 to n. This labeling defines the identity assignment Id 1 . In the input configuration 
(P, e), the probability that all nodes say "yes" when executing A is at least p. Let us identify a subpath 
S = [x — t, . . . , x + t + 1] of P to be used as an internal (8, 2(t + l))-secure subpath in P. I.e., 



Pr[£(P,e,ld\V(S))} > 1-5. 



(A.6) 



Note that, by Eq. ([T| |A.4|[a!5| ), it follows that n > 1(8, 2(t + 1)). Hence by Fact|T2j since (P, e) G C, there 
exists such internal subpath S C P. Consider the event £(P, x, Id, V(S)) stating that all nodes in subpath S 
of P with input x and identity-assignment Id return "yes". We have that 



PT[S(P,e,U 1 ,V(S))} = Pv[£(C,e,ld\V(S))} . 



(A.7) 



Consider a subpath S composed of the subpath S padded with a block Loft nodes before it and a block R 
of t nodes after it. Indeed, since S is an internal subpath of P (i.e., it is at distance at least t+1 from P's 
endpoints), the set of nodes ofS = LoSoR = [x — 2t, . . . , x + 2t + 1] appears consecutively in both P 



n 



and in C with identity assignment Id 1 , and S have the same identities (with Id 1 ) and degrees in both C and 
P. We now consider another identity-assignment Id 2 for P, with nodes labeled consecutively from x + 1 to 
n, and then from 1 to x. Consider the (n — 2(t + l))-node subpath 

S' = [x + 1 + 2, . . . , n, 1, . . . , x — t — 1] . 

We have 

Pr[£(P,e,Id 2 ,F(S'))] = Pr[£(C,e,Id 1 ,F(S"))] , (A.8) 

Consider a subpath S' composed of the subpath S' padded with a block L' of t nodes before it and a block 
B! of f nodes after it, i.e., S' = L' o S' o R 1 . Indeed, the set of S' nodes appears consecutively in both C 
and P with identity assignment Id 2 , and L'oS'o R' have the same identities (with Id 2 ) and degrees in both 
C and P, where V (resp., R') is the subpath composed of the t nodes with identities Id 2 immediately larger 
than x + t + 1 (resp., smaller than x — t). Formally, we have that Id~, = Id-,. See Fig. j3jfor illustration. 
Let S = £{C, e, Id 1 , V(S)), (resp., S' = £(C, e, Id 1 , V(S'))) be the event that all nodes of S C C (resp., 
5' C C) say "yes". We can now combine these previous results to derive a contradiction. Since C ^ Tree, 



by applying the union bound on the events S and S' , and using Eq. (A.7i and (A.8 1, we get that 



q < l-Pr[£(C,e,ld\V(C))} 

< (1 - Pt[S']) + (1 - Pr[5]) 
= {l-Pr[£(P,e,ld 2 ,V(S'))]) 

+ {l-Pv[£(P,e,ld\V(S))]) 

< {l-Pi[£(P,e,ld 2 ,V(S'))})+5 

where the last inequality holds by Eq. < \A.6} . Therefore we get 

q < 1—p + S, 



by noticing that Pr[£(P, e, Id 2 , V(S'))] > p since P G Tree. Finally, by Eq. (A.4), we eventually get 
q < I — p + p + q — 1 or q < q, contradiction. | 



Proof of Claim 4.2: Consider an identity assignment Id" for (P",x"). Let Id and Id' be identity assign- 
ments for (P,x), and (P',x'), respectively, which agree with Id" on the corresponding nodes. That is: (a) 
assignments Id, Id', and Id" agree on the nodes in S, S' and 5"', respectively; (b) Id and Id" agree on the 
nodes in Se and S'J, respectively; and (c) Id and Id" agree on the nodes in S' r and S", respectively. Since 



(P, x) G C, and since \S\ = A > i(S), it follows from Fact 2.2 that 5 contains an internal <5-secure subpath 
H. Then, let H' and H" be the subpaths of P' and P" corresponding to H. Since S and S" coincide in 
their inputs and identity assignments, then H, H' , H" have the same t-neighborhood in P, P', P" respec- 
tively. Hence, H" is also a 5-secure (when running algorithm A in instance (P", x")). Since both (P, x) and 
(P', x') belong to C, we have 

Pr[£ (P;',Id",x") = Pt[£(H £ ,U,x)])] > p 

and 

Pr[W,',Id",x") = Pr[£(H' r ,Id',x')})} > P- 
Moreover, as \H"\ > 2t + 1, the two events £{H' £ ', Id", x") and £{H' r !, Id", x") are independent. Hence 

Pr[£(#;'uP;,Id",x")] >p 2 . 
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In other words, the probability that some node in H'J U H" says "no" is at most 1 — p 2 . It follows, by 
union bound, that the probability that some node in H" says "no" is at most 1 — p 2 + 5 < q. Since A is a 
(p, g)-decider for C, it cannot be the case that (H" , x") C. I 



Proof of Claim 4.3 : Observe that since the running time t is constant, then 2i{5) + 1 is also constant. 
Therefore, there are only finitely many configurations (P,x) with \P\ < 2£(5) + 1 (since £ is finite). Call 
this set of configurations C. Each of the configurations in C is either extendable or not. Hence, there exists a 
function / : C — > {0, 1} such that for every configuration C G C, /(C) = 1 if and only if C is extendable. 
This function / can be described in a finite manner, and hence gives rise to an algorithm as required by the 
claim. | 



Proof of Theorem 5.1: Let r = a/b G [r,r') be a positive rational where a and b are two co-prime 
integers. By the density of the rational numbers, such r is guaranteed to exist. To establish the theorem, we 
consider the language AMOS -a restricted to instances in X, where 

X = {x G {0, 1}* : || x Hi £ [a + 1, a + b - 1]}. 

In other words, the promise says that an input either satisfies AMOS -a, or is far from satisfying AMOS -a 
(very many selected nodes). We prove that AMOS-a G C r >(0) \ C r (t) for every t = o(n). 

We begin by showing that AMOS-a G C r > (0) by considering the following simple randomized algorithm 
that runs in time: every node v which is not selected, i.e., such that x(v) = 0, says "yes" with probability 
1; and every node which is selected, i.e., such that x(v) = 1, says "yes" with probability p l / a , and "no" 
with probability 1 — p l l a . If the graph has s < a nodes selected, then all nodes say "yes" with probability 
ps/a y as deseed Else, there are s > a + b leaders, (this follows from the promise), and at least one 
node says "no" with probability 1 — p s l a > 1 — pi a+b )/ a = 1 — p 1+1 / r . We therefore get a (p, g)-decider 
with p 1+1 l ? + q > 1, thus p 1+1 / r ' + q > 1 as r' > r. It therefore follows that AMOS-a G C r > (0). 

We now consider the harder direction, and prove that AMOS-a ^ C r (t), for any t = o(n). Since r>r, 
it is sufficient to show that AMOS-a ^ C?{t). To prove this separation, consider the AMOS-a problem 
restricted to the family of n-node paths. Fix a function t = o[n), and assume, towards contradiction, that 
there exists a distributed (p, g)-decider A for AMOS-a that runs in 0(t) rounds, with p l+l / r + q > 1. Let 
e G (0, 1) be such that p 1 + 1 / r + £ + g > 1. Let P be an n-node path, and let S C P be a subpath of P. Let 
5 G [0, 1] be a constant satisfying 

< 5 <p l+l/? (1 -p £ )/{a + b- 1). (A.9) 

Consider a positive instance and a negative instance of AMOS-a, respectively denoted by 

I = (P,x) and I' = (P,x'). 

Both instances are defined on the same n-node path P, where 

n> (a + b-1) (£{5) + 1) + 1. 

where £(5) = £(5, 2t + 1), as defined by Eq. ([!]). We consider executions of A on these two instances, where 
nodes are given the same id's. Both instances have almost the same input. In particular, the only difference 
is that instance / contains a selected nodes, whereas /' has the same selected nodes as / plus b additional 
selected nodes. Therefore / is legal, while I' is illegal. In addition, both inputs x and x' satisfy the promise. 

In I', the path P is composed of a + b sections, each containing a unique selected node, and where each 
pair of consecutive sections separated by <5-secure subpaths. More precisely, let us enumerate the nodes of 



iv 



P from 1 to n, with node v adjacent to nodes v — 1 and v + 1, for every 1 < v < n. Consider the a + b — 1 
subpaths of P defined by: 

Si = [(i-l)£(S) + i + l,i-£(S) + i] 

for i = {1, . . . , a + b — 1}. Let the selected nodes in /' be positioned as follows. Let u\ = 1 and let 

Ui = (i — l)£(S) + z for i = 2, . . . , a + 6. Then set 



x v 



1 if v = Ui for some i G {1, a + b} 
otherwise. 



Our next goal is to construct a legal input I = (P, x) with a leaders. Towards this, we begin by showing 
that each Si contains a 5-secure internal subpath S'; = [xj, yi] (internal to Si). Note that Fact 



2.2 



refers to 



subpaths in valid instances (P, x) G £, and since /' is illegal it cannot be directly applied. So instead, let 
us focus on the instance (Si,x' s .) with IDs lds z . Since Si contains no leaders, || x' s . || x =0, it follows that 



(Si , x' s _ ) G £. Now we can safely apply Fact 2.2 Indeed, since | Si \ > 1(8) it follows by the fact that Si con- 
tains an internal 5-secure subpath S[ = [x{ , yij. Therefore, when applying algorithm A on (Si , x' 5 . , Id,^ ) and 
on (P, x', Id), the nodes of % behave the same, thus Pr[£(P, x', Id, V(SQ)] = PT[S(S i ,x' s . , Id^ , V(S-))]. 
Hence, S^ is a (5-secure subpath in /' as well, for every i G {1, a + b — 1}. 

The <5-secure subpaths S^ are used to divide P into 2(a + 6— 1) + 1 segments. There are a + 6 segments 
Tj, i = 1, . . . , a + b, each with one selected nodes. The <5-secure subpaths S[ = [xi, yi] separate Tj from 
Ti+i. More precisely, we set 

Ti = [l,xi - 1], Tj = + l,xi - 1] 

for i G 2, a + 6-1, and T a+b = [y a+b + 1, n], getting 

P = Ti o S[oT 2 o S' 2 ° ■ ■ ■ ° T a+b _i o S'^ +6 _ 1 o T a+b 

where o denotes path concatenation. For i G {1, a + b}, let % = £(P, x' , Id, V(Ti)) be the event that all 
nodes in the subpath Ti say "yes" in instance i 7 , and let p. t = Pr [71] its probability. Let J = {j\ , . . . , j b } be 
the set of b indices with maximal values in {pi, . . . ,p a+b }. I.e., pj > max{j?j | % G {1, a + 6} \ J} for 
every j G J. We are now defining the valid instance / = (P, x): 



x(v) 



1 if v = Ui for some i £ {1, ...,a + b}\ J 
otherwise. 



We therefore have that || x' \\ x = a + b and || x \\ 1 = a, thus I G AMOS-a while I' AMOS-a, and both 
I, I' satisfy the promise. We now make the following immediate observation. 

Claim A.1 Pr[£(P, x, Id, V(Ti))] = Pi , for every i £ J. 

This follows since the distance between any two nodes u (resp., v ) in distinct T[s is greater than t, which 
implies that x(Li o Ti o Rj) = x'(Li o Tj o Rj) where Li (resp., Ri) is the subpath of length t to the left 
(resp., to the right) of in P, from which it follows that under A the nodes of Tj have the same behavior in 
both instances I and /' for every i ^ J. | 

Let Af (resp., M') be the event that there exists at least one node in I (resp., /') that says "no" when 
applying algorithm A. Similarly, let y (resp., y') be the event that all nodes in the configuration I (resp., i 7 ) 
say "yes". Let T = Ui=i ^ ^ e tne event tnal: a U nodes in the subpaths T, for i G {1, o + b} say "yes" 
in the instance i 7 . For every i G {1, a + b — 1}, let Si = £(P,x', Id, V(Sj)) be the event that all nodes 
in the <5-secure subpath S^ say "yes" in the instance I'. We have 

PrQ>) =Pv[£(P,x,ld,V(P))\ 



v 



and 

Pr(y) = Pr[£(P,x',ld,V(P))} 

while 

Pr(AA) = 1 - Pr(^) 

and 

Pr(AA') = l-Pr(y). 
Since A a (p, g)-decider, as we assume by contradiction that AMOS-a 6 B^, we have 

Pr(/V') > q, 

and thus 

Pr(AA') > l- V 1+1 l ?+£ . 

Therefore, Pr(y) < p 1+1 / r+£ . Moreover, since / € AMOS-a, we also have that Pr(y > p. Therefore, 

^<P"-. <A.K» 
On the other hand, by applying the union bound to the a + b events T, U^i 1 <%> we § et tnat 

a+b-l 

Pr(/V') < (1 - Pr[7l) + ^ (1 - Pr[<Si]) 

i=l 

< l-(HP' t -llPj) + (a + b-l)-6, 

i$J jeJ 

where the last inequality follows since S^'s are <5-secure subpaths and hence the events 7?s are independent. 
We therefore get that 

Pr(y) > [Hp, ■ Y[ Pj J - (a + b - 1) • 6 . 
Since Pr(y < Yl^jPi, it then follows that 

pr(y) n^jp«-rijgjPj-(Q+fc-i)^ 
pr(y - n^jK 

Now, since / € AMOS-a, we have Yii^jPi — P r (3^) > P, and thus 



Note that 



n^>^ i/f - (a. i2) 



vi 



By the definition of J,\J\ = b, and Ylj e jPj > Pi for every i ^ J. In addition, since there are a indices 

i J, we get that (Y\j £ j Pj) a ^ b > Yli^jPi > Pr(^V) > P- Combining with the definition of r, Eq. (A. 12 1 
follows. Hence, by Eq. ( |A.11[ ), we get 



Pr(y)/Pr(y)>^- (Q + 6 - lM . 

V 



Combining with Eq. (A. 10 1 we get that 



P 



1/r - (a + b - 1) • S/p < p 1/r+e 



which is in contradiction to the definition of 5 in Eq. (A.9). We therefore get that AMOS-a ^ Cf(i), and 
since r < r, it also holds that AMOS-a ^ C r (t) as required. The theorem follows. 1 
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Figure 1: Presented is path P. The green £(5, A) -length segment corresponds to S. The red subpath Bi 
whose center is vi is a (8, A)-secure subpath candidate, as i G P. 
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Figure 2: Illustration of the constructions for Theorem 3.1 (a) The instance I' = (P,x') with k + 1 
leaders separated by £(5)-length segements, S*. (b) The J-secure subpaths Si- in each are 
internal to Si. (c) The leader-segments Ti interleaving with 5-secure subpaths (d) The legal 
instance I = (P,x), the j" 1 leader of 7' is discarded, resulting in a & leader instance. 
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Figure 3: Illustration of the constructions for Theorem 3.3 Shown are paths P\,P2 and cycle C. When 
applying Algorithm A on path Pi (respectively, P2) and on cycle C, the nodes in the segment [x—t, x+t+1] 
(resp., [x + 1 + 2, . . . , 1, . . . , x + t + 1]) behave the same. 
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Figure 4: Example of a A-path triplet (the red zone is of length at least A). 
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